2 a Nonparametric Multi Class Partitioning Method for Classification
نویسندگان
چکیده
c classes are characterized by unknown probability distributions. A data sample containing labelled vectors from each of the c classes is available. The data sample is divided into test and training samples. A classifier is designed based on the training sample and evaluated with the test sample. The classifier is also evaluated based on its asymptotic properties as sample size increases. A multi class recursive partitioning algorithm which generates a single binary decision tree for classifying all classes is given. The algorithm has the same desirable statistical and computational properties as Friedman's (1977) 2-class algorithm. Prior probabilities and losses are accounted for. A tree termination algorithm which terminates binary decision trees in a statistically optimal manner is given. r,or~ don and Olshen's (1978) results on the asymptotic Bayes risk efficiency of 2-class recursive partitioning algorithms are extended to the c-class case and applied to the combined partitioning/termination algorithm. Asymptotic efficiency and consistent risk estimates are obtained with independent test and training sequences. ACKNOWLEDGEMENT I would like to thank Professor Sanjoy Mitter for his intellectual and financial support throughout the course of this research. I would also like to acknowledge Don Gustafson whose ideas contributed heavily to the initial phase of the work.
منابع مشابه
تحلیل ممیز غیرپارامتریک بهبودیافته برای دستهبندی تصاویر ابرطیفی با نمونه آموزشی محدود
Feature extraction performs an important role in improving hyperspectral image classification. Compared with parametric methods, nonparametric feature extraction methods have better performance when classes have no normal distribution. Besides, these methods can extract more features than what parametric feature extraction methods do. Nonparametric feature extraction methods use nonparametric s...
متن کاملExploiting Associations between Class Labels in Multi-label Classification
Multi-label classification has many applications in the text categorization, biology and medical diagnosis, in which multiple class labels can be assigned to each training instance simultaneously. As it is often the case that there are relationships between the labels, extracting the existing relationships between the labels and taking advantage of them during the training or prediction phases ...
متن کاملMULTI CLASS BRAIN TUMOR CLASSIFICATION OF MRI IMAGES USING HYBRID STRUCTURE DESCRIPTOR AND FUZZY LOGIC BASED RBF KERNEL SVM
Medical Image segmentation is to partition the image into a set of regions that are visually obvious and consistent with respect to some properties such as gray level, texture or color. Brain tumor classification is an imperative and difficult task in cancer radiotherapy. The objective of this research is to examine the use of pattern classification methods for distinguishing different types of...
متن کامل1 A NONPARAMETRIC MULTICLASS PARTITIONING METHOD FOR CLASSIFICATION by SAUL BRIAN GELFAND
c classes are characterized by unknown probability distributions. A data sample containing labelled vectors from each of the c classes is available. The data sample is divided into test and training samples. A classifier is designed based on the training sample and evaluated with the test sample. The classifier is also evaluated based on its asymptotic properties as sample size increases. A mul...
متن کاملMulti-class Classification: A Coding Based Space Partitioning
In this work we address the problem of multi-class classification in machine learning. In particular, we consider the coding approach which converts a multi-class problem to several binary classification problems by mapping the binary labeled space into several partitioned binary labeled spaces through binary channel codes. By modeling this learning problem as a communication channel, these cod...
متن کامل